FIGURE 1A 

Sequence of cadherin 3 (GenBank accession number NPJ301784) 
SEQIDNO:l 

MGLPRGPLASLLLLQVCWLQCAASEPCRAVFREAEVTLEAGGAEQEPGQALGK 

VFMGCPGQEPALFSTDNDDFTVRNGETVQERRSLKERNPLKIFPSKRILRRHKRD 

WWAPISWENGKGPFPQRLNQLKSNKDRDTOFYSITGPGADSPPEGVFAVEKE 

TGWLLLNKPLDREEIAKYELFGHAVSENGASVEDPMNISIIVTDQNDHKPKFTQD 

TFRGSVLEGVLPGTSVMQVTATDEDDAIYTYNGVVAYSfflSQEPKDPHDLMFTI 

HRSTGTISVISSGLDREKVPEYTLTIQATDMDGDGSTTTAVAWEILDANDNAPM 

FDPQKYEAHVPENAVGHEVQRLTVTDLDAPNSPAWRATYLIMGGDDGDHFTITT 

HPESNQGILTTRKGLDFEAKNQHTLYVEVTNEAPFVLKLPTSTATIVVHVEDVNE 

APVFVPPSKVVEVQEGffTGEPVCVYTAEDPDKENQKISYRILRDPAGWLAMDPD 

SGQVTAVGTLDREDEQFVRNNIYEVMVLAMDNGSPPTTGTGTLLLTLIDVNDHG 

PVPEPRQITICNQSPVRHVLNITDKDLSPHTSPFQAQLTDDSDIYWTAEVNEEGDT 

WLSLKKFLKQDTYDVHLSLSDHGNKEQLTVIRATVCDCHGHVETCPGPWKGG 

FILPVLGAVLALLFLLLVLLLLVRKKRKIKEPLLLPEDDTRDNVFYYGEEGGGEE 

DQDYDITQLHRGLEARPEWLRNDVAPTIIPTPMYRPRPANPDEIGNFIIENLKAA 

NTDPTAPPYDTLLVFDYEGSGSDAASLSSLTSSASDQDQDYDYLNEWGSRFKKL 

ADMYGGGEDD 



FIGURE IB 

Sequence of matrix metalloproteinase 14 (GenBank accession number NP_004986) 
SEQ ID N0:2 

MSPAPRPPRCLLLPLLTLGTALASLGSAQSSSFSPEAWLQQYGYLPPGDLRTHTQ 

RSPQSLSAAIAAMQKFYGLQVTGKADADTMKAMRRPRCGVPDKFGAEIKANVR 

RKRYAIQGLKWQHNEITFCIQNYTPKVGEYATYEAIRKAFRVWESATPLRFREVP 

YAYIREGHEKQADIMIFFAEGFHGDSTPFDGEGGFLAHAYFPGPNIGGDTHFDSA 

EPWTVRNEDLNGNDIFLVAVHELGHALGLEHSSDPSAIMAPFYQWMDTENFVLP 

DDDRRGIQQLYGGESGFPTKMPPQPRTTSRPSVPDKPKNPTYGPNICDGNFDTVA 

MLRGEMFVFKERWFWRVRNNQVMDGYPMPIGQFWRGLPASINTAYERKDGKF 

VFFKGDKHWVFDEASLEPGYPKHIKELGRGLPTDKIDAALFWMPNGKTYFFRGN 

KYYRFNEELRAVDSEYPKNIKVWEGIPESPRGSFMGSDEVFTYFYKGNKYWKFN 

NQKLKVEPGYPKSALRDWMGCPSGGRPDEGTEEETEVniEVDEEGGGAVSAAA 

VVLPVLLLLLVLAVGLAVFFFRRHGTPRRLLYCQRSLLDKV 



FIGURE 1C (1) 

Sequence of cadherin EGF LAG seven-pass G-type receptor 2 (GenBank accession 
number NP_001399) 

SEQE>NO:3 

MRSPATGVPLPTPPPPLLLLLLLLLPPPLLGDQVGPCRSLGSRGRGSSGACAPMG 

WLCPSSASNLWLYTSRCRDAGTELTGHLVPHHDGLRVWCPESEAHIPLPPAPEG 

CPWSCRLLGIGGHLSPQGKLTLPEEHPCLKAPRLRCQSCKLAQAPGLRAGERSPE 

ESLGGRRKRNVNTAPQFQPPSYQATVPENQPAGTPVASLRAIDPDEGEAGRLEYT 

MDALFDSRSNQFFSLDPVTGAVTTAEELDRETKSTHVFRVTAQDHGMPRRSALA 

TLTILVTDTNDHDPVFEQQEYKESLRENLEVGYEVLTVRATDGDAPPNANILYRL 

LEGSGGSPSEVFEIDPRSGVIRTRGPVDREEVESYQLTVEASDQGRDPGPRSTTAA 

VFLSVEDDNDNAPQFSEKRYVVQVREDVTPGAPVLRVTASDRDKGSNAWHYSI 

MSGNARGQFYLDAQTGALDWSPLDYETTKEYTLRVRAQDGGRPPLSNVSGLV 

TVQVLDINDNAPIFVSTPFQATVLESVPLGYLVLHVQAIDADAGDNARLEYRLAG 

VGHDFPFTINNGTGWISVAAELDREEVDFYSFGVEARDHGTPALTASASVSVTVL 

DVNDNNPTFTQPEYTVRLNEDAAVGTSW 

FSrrSQSGGGLVSLALPLDYKLERQWLAVTASDGTRQDTAQIVVNVTDANTHRP 

VFQSSHYTVNVNEDRPAGTTWLISATDEDTGENARITYFMEDSIPQFRIDADTG 

AVTTQAELDYEDQVSYTLAITARDNGIPQKSDTTYLEILVNDVNDNAPQFLRDSY 

QGSVYEDVPPFTSVLQISATDRDSGLNGRVFYTFQGGDDGDGDFIVESTSGIVRT 

LRRLDRENVAQYVLRAYAVDKGMPPARTPMEVTVTVLDVNDNPPVFEQDEFDV 

FVEENSPIGLAVARVTATDPDEGTNAQIMYQiVEGNIPEVFQLDIFSGELTALVDL 

DYEDRPEYVLVIQATSAPLVSRATVHVRLLDRNDNPPVLGNFEILFNNYVTNRSS 

SFPGGAIGRVPAHDPDISDSLTYSFERGNELSLVLLNASTGELKLSRALDNNRPLE 

AMSVLVSDGVHSVTAQCALRVTIITDEMLTHSITLRLEDMSPERFLSPLLGLFIQA 

VAATLATPPDHWVFNVQRDTDAPGGHILNVSLSVGQPPGPGGGPPFLPSEDLQE 

RLYLNRSLLTAISAQRVLPFDDNICLREPCENYMRCVSVLRFDSSAPFIASSSVLFR 

PIHPVGGLRCRCPPGFTGDYCETEVDLCYSRPCGPHGRCRSREGGYTCLCRDGYT 

GEHCEVSARSGRCTPGVCKNGGTCVNLLVGGFKCDCPSGDFEKPYCQVTTRSFP 

AHSFITFRGLRQRFHFTLALSFATKERDGLLLYNGRFNEKHDFVALEVIQEQVQL 

TFSAGESTTTVSPFVPGGVSDGQWHTVQLKYYNKPLLGQTGLPQGPSEQKVAW 

TVDGCDTGVALRFGSVLGNYSCAAQGTQGGSKKSLDLTGPLLLGGVPDLPESFP 

VRMRQFVGCMRNLQ\ODSRHE)MADFIANNGTVPGCPAKKNVCDSNTCHNGGT 

CVNQWDAFSCECPLGFGGKSCAQEMANPQHFLGSSLVAWHGLSLPISQPWYLSL 

MFRTRQADGVLLQAITRGRSTITLQLREGHVMLSVEGTGLQASSLRLEPGRAND 

GDWHHAQLALGASGGPGHAILSFDYGQQRAEGNLGPRLHGLHLSNITVGGIPGP 

AGGVARGFRGCLQGVRVSDTPEGVNSLDPSHGESINVEQGCSLPDPCDSNPCPA 

NSYCSNDWDSYSCSCDPGYYGDNCTNVCDLNPCEHQSVCTRKPSAPHGYTCEC 

PPNYLGPYCETRIDQPCPRGWWGHPTCGPCNCDVSKGFDPDCNKTSGECHCKEN 

HYRPPGSPTCLLCDCYPTGSLSRVCDPEDGQCPCKPGVIGRQCDRCDNPFAEVTT 

NGCEVNYDSCPRAIEAGiWWPRTRFGLPAAAPCPKGSFGTAVRHCDEHRGWLPP 

NLFNCTSITFSELKGFAERLQRNESGLDSGRSQQLALLLRNATQHTAGYFGSDVK 

VAYQLATRLLAHESTQRGFGLSATQDVHFTENLLRVGSALLDTANKRHWELIQQ 



Figure 1C continued 



(2) 



TEGGTAWLLQHYEAYASALAQNMRHTYLSPFTIVTPNIVISVVRLDKGNFAGAK 

LPRYEALRGEQPPDLETTVILPESWRETPPVVRPAGPGEAQEPEELARRQRRHPE 

LSQGEAVASVIIYRTLAGLLPHNYDPDKRSLRVPKRPnNTPVVSISVHDDEELLPR 

ALDKPVTVQFRLLETEERTKPICVFWNHSILVSGTGGWSARGCEWFRNESHVSC 

QCNHMTSFAVLMDVSRRENGEILPLKTLTYVALGVTLAALLLTFFFLTLLRILRS 

NQHGIRRNLTAALGLAQLVFLLGINQADLPFACTVIAILLHFLYLCTFSWALLEAL 

HLYRALTEVRDVNTGPMRFYYMLGWGVPAFITGLAVGLDPEGYGNPDFCWLSI 

YDTLIWSFAGPVAFAVSMSVFLYILAARASCAAQRQGFEKKGPVSGLQPSFAVLL 

LLSATWLLALLSVNSDTLLFHYLFATCNCIQGPFIFLSYVVLSKEVRKALKLACSR 

KPSPDPALTTKSTLTSSYNCPSPYADGRLYQPYGDSAGSLHSTSRSGKSQPSYIPF 

LLREESALNPGQGPPGLGDPGSLFLEGQDQQHDPDTDSDSDLSLEDDQSGSYAST 

HSSDSEEEEEEEEEEAAFPGEQGWDSLLGPGAERLPLHSTPKDGGPGPGKAPWPG 

DFGTTAKESSGNGAPEERLRENGDALSREGSLGPLPGSSAQPHKGILKKKCLPTIS 

EKSSLLRLPLEQCTGSSRGSSASEGSRGGPPPRPPPRQSLQEQLNGVMPIAMSIKA 

GT VDEDS SGSEFLFFNFLH 



Figure ID 

Peptides for antibodies that bind to cadherin3 (GenBank accession number NP_001784): 
RAVFREAEVTLEAGGAEQE (SEQ ID NO:4) 
QEPALFSTDNDDFTVRN (SEQ ID NO:5) 
QKYEAHVPENAVGHE (SEQ ID NO:6) 

Peptides for antibodies that bind to matrix metalloproteinase 14 (GenBank accession 
number NP_004986): 

AYIREGHEKQADEvlIFFAE (SEQ ID NO:7) 
DEASLEPGYPKHKELGR (SEQ ID NO:8) 
RGSFMGSDEVFTYFYK (SEQ ID NO:9) 

Peptides for antibodies that bind to anti-cadherin EGF LAG seven-pass G-type receptor 2 
(GenBank accession number NP 001399): 

QASSLRLEPGRANDGDWH (SEQ ID NO: 10) 

ELKGFAERLQRNESGLDSGR (SEQ ID NO: 1 1) 

RSGKSQPSYIPFLLREE (SEQ ID NO: 12) 

Peptides for antibodies that bind to anti-cytokeratinl7: 
KKEPVTTRQVRTIVEE (SEQ ED NO: 13) 
QDGKVISSREQVHQTTR (SEQ ID NO: 14) 
SSSIKGSSGLGGGSS (SEQ ID NO: 15) 



3294483 l.DOC 



FIGURE 2 



Intrinsic Gene Subset 
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Epithelial-Enriched Gene Subset 
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FIGURE 4A 




FIGURE 4B 




FIGURE 4C 



Figure 5A 




Figure 5B 
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FIGURE 6 



